Mining microorganism EST databases in the quest for new proteins.
نویسندگان
چکیده
Microorganisms with large genomes are commonly the subjects of single-round partial sequencing of cDNA, generating expressed sequence tags (ESTs). Usually there is a great distance between gene discovery by EST projects and submission of amino acid sequences to public databases. We analyzed the relationship between available ESTs and protein sequences and used the sequences available in the secondary database, clusters of orthologous groups (COG), to investigate ESTs from eight microorganisms of medical and/or economic relevance, selecting for candidate ESTs that may be further pursued for protein characterization. The organisms chosen were Paracoccidioides brasiliensis, Dictyostelium discoideum, Fusarium graminearum, Plasmodium yoelii, Magnaporthe grisea, Emericella nidulans, Chlamydomonas reinhardtii and Eimeria tenella, which have more than 10,000 ESTs available in dbEST. A total of 77,114 protein sequences from COG were used, corresponding to 3,201 distinct genes. At least 212 of these were capable of identifying candidate ESTs for further studies (E. tenella). This number was extended to over 700 candidate ESTs (C. reinhardtii, F. graminearum). Remarkably, even the organism that presents the highest number of ESTs corresponding to known proteins, P. yoelii, showed a considerable number of candidate ESTs for protein characterization (477). For some organisms, such as P. brasiliensis, M. grisea and F. graminearum, bioinformatics has allowed for automatic annotation of up to about 20% of the ESTs that did not correspond to proteins already characterized in the organism. In conclusion, 4093 ESTs from these eight organisms that are homologous to COG genes were selected as candidates for protein characterization.
منابع مشابه
Searching the Expressed Sequence Tag (EST) Databases: Panning for Genes
The genomes of living organisms contain many elements, including genes coding for proteins. The portions of the genes expressed as mature mRNA, collectively known as the transcriptome, represent only a small part of the genome. The expressed sequence tag (EST) databases contain an increasingly large part of the transcriptome of many species. For this reason, these databases are probably the mos...
متن کاملThe Effect of Different Additives and Medium on the Bioleaching of Molybdenite for Cu and Mo Extraction Using Mix Mesophilic Microorganism
Bioleaching processes for extraction of Cu and Mo from molybdenite cons. are more environmentally friendly and consume less energy than conventional technologies, yet less economically efficient. One necessary step towards arriving at a cost-effective bioleaching process is using appropriate methodology to optimize pertinent factors in such processes. To this end, the present study employed Res...
متن کاملEST mining and functional expression assays identify extracellular effector proteins from the plant pathogen Phytophthora.
Plant pathogenic microbes have the remarkable ability to manipulate biochemical, physiological, and morphological processes in their host plants. These manipulations are achieved through a diverse array of effector molecules that can either promote infection or trigger defense responses. We describe a general functional genomics approach aimed at identifying extracellular effector proteins from...
متن کاملA Review of Designing New Vaccines to Prevent Hospital-Acquired Antibiotic-Resistant Infections
Hospital-acquired infections are one of the main challenges and concerns of patients and medical staff in hospitals and healthcare centers. Meanwhile, Clostridium difficile infection is one of the most important bacterial hospital infections. Prevention is the best and most effective way to deal with these infections. Designing and using vaccines against these infectious microorganisms is the b...
متن کاملComputational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)
Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genetics and molecular research : GMR
دوره 2 1 شماره
صفحات -
تاریخ انتشار 2003